Reordering Columns for Smaller Indexes

نویسندگان

  • Daniel Lemire
  • Owen Kaser
چکیده

Column-oriented indexes—such as projection or bitmap indexes—are compressed by run-length encoding to reduce storage and increase speed. Sorting the tables improves compression. On realistic data sets, permuting the columns in the right order before sorting can reduce the number of runs by a factor of two or more. For many cases, we prove that the number of runs in table columns is minimized if we sort columns by increasing cardinality. Yet—maybe surprisingly—we must sometimes maximize the number of runs to minimize the index size. Experimentally, sorting based on Hilbert space-filling curves is poor at minimizing the number of runs.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Sorting improves word-aligned bitmap indexes

Bitmap indexes must be compressed to reduce input/output costs and minimize CPU usage. To accelerate logical operations (AND, OR, XOR) over bitmaps, we use techniques based on run-length encoding (RLE), such as Word-Aligned Hybrid (WAH) compression. These techniques are sensitive to the order of the rows: a simple lexicographical sort can divide the index size by 9 and make indexes several time...

متن کامل

Evaluation of Some Reordering Techniques for Image VQ Index Compression

Frequently, it is observed that the sequence of indexes generated by a vector quantizer (VQ) contains a high degree of correlation, and, therefore, can be further compressed using lossless data compression techniques. In this paper, we address the problem of codebook reordering regarding the compression of the image of VQ indexes by general purpose lossless image coding methods, such as JPEG-LS...

متن کامل

Efficient Reordering for Direct Methods in Analog Circuit Simulation

For linear systems of equations arising from circuit simulation, we studied the efficiency of the reordering algorithm integrated in the MA27 subroutine package from Duff and Reid. Our conclusion is that the algorithm performs extraordinarily well, provided that the nearly-dense rows and columns of the matrix have been removed prior to reordering. Thus, we recommend the reordering implementatio...

متن کامل

A new technique for the Simplex basis LU factorization update

The objective of this work is to develop more efficient alternatives for Simplex method implementation. Techniques of the Simplex basis LU factorization update are developed to improve the solution of the Simplex method linear systems to achieve a column static reordering of the matrix. A simulation of the Simplex method is implemented, with the change of basis obtained from MINOS. Only the fac...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Inf. Sci.

دوره 181  شماره 

صفحات  -

تاریخ انتشار 2011